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ABSTRACT 



Every experiment is affected by systematic effects that hamper the data analysis and have the potential to ultimately 
degrade its performance. In the case of probes of the cosmic microwave background (CMB) radiation, a minimal set 
of issues to consider includes asymmetric beam functions, correlated noise, and incomplete sky coverage. Presuming 
a simplified scanning strategy that allows for an exact analytical treatment of the problem, we study the impact of 
systematic effects on the likelihood function of the CMB power spectrum. We use the Fisher matrix, a measure of the 
information content of a data set, for a quantitative comparison of different experimental configurations. In addition, for 
various power spectrum coefficients, we explore the functional form of the likelihood directly, and obtain the following 
results: The likelihood function can deviate systematically from a Gaussian distribution up to the highest multipole 
values considered in our analysis. Treated exactly, realistic levels of asymmetric beam functions and correlated noise do 
not by themselves decrease the information yield of CMB experiments nor do they induce noticeable coupling between 
multipoles. Masking large fractions of the sky, on the other hand, results in a considerably more complex correlation 
structure of the likelihood function. Combining adjacent power spectrum coefficients into bins can partially mitigate 
these problems. 

Key words, cosmic background radiation - Methods: data analysis - Methods: statistical 



1. Introduction 

Owing to its sensitive shape dependence on the most fun- 
damental cosmological parameters, the power spectrum of 
the cosmic microwave background (CMB) radiation is one 
of the m ost important pro bes of the properties of the early 
universe (Mukhano3[2004). As a consequence, a large num- 
ber of experiments have been designed to improve the 
observationa l cons t raints on the CMB an i sotropics (e.g.. 
Smoot et all 119921: [i^ tterneld ^eTall 120021: iBennett et al 



2003; iKuo et al.l 120041: ID unklev et al.l 120111: iKeisler et al . 
2011; Planc k Collaboration et al.l 1201 ll ). With the increas- 



ing precision of observations, the demands on the numerical 
tools used for data analysis became more and more strin- 
gent. Given the data obtained with the Planck satellite, for 
example, the temperature power spectrum will be cosmic 
variance limited from the l argest scales up to a m ultipole 
moment of about £ w 2500 (jEfstathiou et al.ll2QQ5h . To take 
full advantage of this wealth of information, a precise un- 
derstanding and thorough modeling of the data properties 
are necessary. 

The Fisher information matrix is ubiquitous in statis- 
tics. Used in the context of the Cramer-Rao bound, it per- 
mits a quantitative assessment of an experiment's ability 
to constrain a set of parameters. Furthermore, it also al- 
lows drawing conclusions about parameter degeneracies. 
Owing to its predictive power, the Fisher matrix has been 
widely used to characterize and optimize experiments in 
various fields of astronomy (for example, in cosmology, e.g., 



| Hu fc Tegma rk 1999: ]Berti et al.l 120051 : [Rimes fc Hamilton! 
2005; iLee fc Pe n 2008). Since the computational costs as- 
sociated with its numerical calculation are prohibitively 
large, such studies have not b een realized for general h igh- 
resolution CMB experiments. lElsner fc~ Wandelt (|2012l ) in- 
troduced a method for an efficient yet exact calculation 
of the Fisher matrix and of the full likelihood in only 
0(^ ax ) operations. This mathematical framework enables 
a quantitative study of several systematic effects on CMB 
power spectrum constraints. In this work, we focus on com- 
mon systematics like asymmetric beam functions, corre- 
lated noise, partial sky coverage, and the effect of binning. 
For different experimental setups, we analyze the correla- 
tion structure of the power spectrum coefficients and ex- 
plore the shape of the likelihood function itself. 

Cosmological parameter estimation from a power 
spectrum is commonly perfo rmed by means of Monte 
Carlo sampling techn iques (e.g. lChristensen fc Mey er 2000; 
iLewis fc Bridld 12002). Owing to the costs associated with 
the numerical evaluation of the full likelihood function 
given the data map, exact methods are re stricted to the 
lowes t multipole moments, l ma ,x ^ 50 (e.g. iHinshaw et al.l 
120071 : iGruDDUso endl 120091 ). Beyond that, the likelihood 
function is usually written in terms of the power spec- 
trum coefficients themselves, a considerably less demand- 
ing problem. To do so, the function has to be approximated 
by, e.g., a multivariate Gaussian distribution (as done for 
the analysis of the Atacama Cosmology Telescope data, 
IDunklev et aIll201lD . or some other choice, such as a combi- 
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nation of a Gaussian and an offset log-normal d istribution 
(as done by the WMAP team. IVerde et al.ll2003l ). Using our 
methods, we are able to give examples of generic features 
of the true underlying likelihood function which should be 
reproduced by these approximations to avoid introducing 
systematic biases in the cosmological parameter determi- 
nations. 

Since our method allows an exact modeling of the data 
in its full complexity, we can address a new question about 
how different instrumental and experimental properties af- 
fect the information content of a CMB data set: rather 
than asking how systematics affect results if they are not 
taken into account optimally (see, e.g . , the discussion of 
correlated noise in Del abrouilld [l99 8: Main o et all I1999L 
asym metric beams in iBurigana fc Sa ez 2003; Mit ra et al.l 
201 l j and partial s ky coverage in | Szapudi et al.l |2001; 
Wan delt et all 120011 : iHivon et al. 2002), we can determine 
to what extent these systematics fundamentally degrade 
power spectrum constraints even after an analysis that 
takes their effects into account optimally. In all the analyses 
we present in this paper, the instrumental characteristics of 
the simulated experiment (such as the noise model and the 
asymmetric beam maps) are assumed to be known exactly. 
To make this assumption true for realistic experiments re- 
quires a careful calibration strategy. Therefore, the results 
we discuss should be seen as hard lower limits on the effects 
of systematics on CMB power spectrum inferences. 

The paper is organized as follows. In Sect. [2j we briefly 
review the mathematical background used for our study. 
We then assess the impact of important experimental sys- 
tematics on the likelihood function (Sect. [3]). Finally, we 
summarize our findings in Sect. HI 



2. The likelihood analysis on the ring torus 

Consider an arbitrary CMB experiment with Gaussian 
noise, the likelihood function C of the data d is 



C(C e \d) 



1 



y/\2ir(S(C t )+VS)\ 



x exp 



-Idt (S (C £ ) + N)- 



(1) 



where we have introduced the signal covariance matrix, S, 
as a function of the CMB power spectrum coefficients Cg. 
The noise properties are characterized by the noise covari- 
ance matrix, N. 

In the most general case, the evaluation of Eq. (pQ) takes 
0(N 3 ) operations, where N is the number of pixels in 
the survey (|Borrilllll999h . To enable a numerically feasi- 
ble analysis, we only consider experiments with an ideal- 
ized scanning strategy, where the sky is observed on iso- 
latitude circles. The mathemat ical background of t he al- 
gori thm is explained in detai l in I Wandelt fc Hansen! (2003) 
an sner fc Wandeltl (|2012f ). Following their approach, we 
then map the time-ordered data (TOD) onto the ring torus. 
We visualize this duality in Fig. [TJ Casting the equations 
into Fourier space allows us to take advantage of the peri- 
odicity of the manifold - the expression for the likelihood 
function simplifies. 

To obtain the signal covariance matrix in Fourier basis, 
we specify the noise-free sky temperature as a function of 




Fig. 1. Idealizing the scanning strategy leads to additional 
symmetries. Observing the sky on iso-latitude rings with a 
regular spacing in azimuthal direction (left panel) allows to 
map the resulting data structure onto a ring torus (right 
panel). Then, the formulation of the likelihood function in 
Fourier basis simplifies. 



the spherical harmonic coefficients of a signal map, a^ m , 

Trp = ^2 a£r ^rp(^s) Xt p . (2) 



Here, the index p specifies the Fourier modes in the in-ring 
direction, and r labels the index for the cross-ring direction. 
In Eq. ([2]), we make use of the definition of the Wigner 
rotation matrix to introduce the real quantity d, 



D e mm ,{<h,6,4>i)=e- im ** C(«)e- 



(3) 



which we evaluate at the latitude of the experiment's spin 
axis, = S . We also apply the rotated beam X according 
to 



Xfm. — 



2£ + l 

47T 



(4) 



where the Wigner small d matrix is now computed at the 
opening angle of the scanning circles, 9 Q . The spherical har- 
monic expansion coefficients of the beam pattern, 6^ m , are 
taken into account exactly and need not to be azimuthally 
averaged. The signal covariance matrix (Tf p Tfi*,) then sim- 
plifies, 

Qrpr'p' — S rr f 



In Fourier space, S shows a block diagonal structure. 

We now specify the noise correlations. For an experi- 
ment with stationary noise, we can fully characterize its 
properties by a power spectrum P(k) in the TOD domain. 
This ansatz is general enough to include the exact treat- 
ment of, e.g., l// 2 -type noise, a common systematic in real- 
world experiments. The noise covariance matrix {T^ p T^ p 1) 
now reduces to 



N n -1 



INL 



1 (pm—p'm') 



m,m'—Q 

N r -1 

X 

A=0 



^e-® Ar C(A,m,m'), (6) 
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where 7V P is the number of pixels per ring, and N r is the 
number of rings in the data set. We introduced an auxiliary 
function C(A, ra, ra') according to 



N-l 

C(A,ra,ra') = P ( k ) e ~ 

k=0 



ik(N T A+m—m') 



(7) 



In a Fourier space representation, we also find the noise 
covariance matrix N to be block diagonal. 

We now review an efficient scheme to calculate the 
Fisher information matrix. Defined as th e covariance of the 
score function, from Eq. (pQ), we obtain ([Tegmarklll997l ) 



d 2 ln£ 

dC £l dC i2 

-a [c-^^c- 1 ^ 2 ] 



(8) 



where C = S + N, and P £ = dC/dCe. For the derivative 
of the covariance matrix with respect to a power spectrum 
coefficient, we find for each matrix block r, 

dC r \ __ fdS r \ 
9Ci) ppf ~\dcJ pp/ 



N 2 d e rp Xe p d e rp ,Xp p , 

Hrp Hrp' ' 



(9) 



On the ring torus, each block in Eq. © is a rank one ob- 
ject and can be decomposed according to Pf = qf, q^ . The 
calculation of the Fisher matrix therefore simplifies, 



(10) 



For completeness, we also provide the expression for the 
score function on the ring torus, 



dln£ 1 



dC t 



2^ 



(Mtc^Yf-^tc-V) , (ii) 



which, just as the Fisher matrix, can also be calculated 
efficiently owing to its block diagonal structure. 

To evaluate the equations numerically, we developed 
a hybrid OpenMP/MPI implementation of the algorithm. 
The intrinsic parallelism of the method is reflected in a good 
scaling behavior for up to 10 000 CPU cores and more. 

3. The role of systematic effects 

For a detailed study of various effects, we directly compare 
a reference simulation, with a simple setup, to a second data 
realization where we included another layer of complexity. 
Although we only consider an idealized experiment with 
simplified scanning strategy, this concurrent analysis allows 
us to draw conclusions about the general relevance of the 
effect under study. 

For our benchmark simulation, we used the 
WMAP7+BAO+H0 cosmological parameters 

([Komatsu et al.l l201ll ) to generate a CMB tempera- 
ture map, up to a maximum multipole moment of 
tax = 1023. According to Eq. (|2]), we sampled the 
signal regularly on 2048 2 points on the sphere, i.e., 
N p = N r = 2£ ma x + 1 = 2048. The scanning rings had 



an opening angle of o = 40° at a latitude of S = 70°, 
leading to a sky coverage of about f sky = 60 %. For the 
beam function, we used a symmetric Gaussian profile with 
a full width at half maximum (FWHM) of 12'. In the 
noise model considered here, we parameterized the noise 
power spectrum in the TOD domain by means of a simple 
function, 

P(k) = a 2 [1 + (k knee /k) a ] , (12) 

allowing to include l// a -type correlations. For the bench- 
mark, we added a noise realization drawn from a white 
noise power spectrum (i.e., a = 0) with a fixed amplitude 
a = 10 mK. At this level, the experiment is signal domi- 
nated up to the highest multipole moment considered. 

We base our analysis on a twofold approach: on the 
evaluation of the likelihood function, and on the Fisher in- 
formation matrix. Starting from the fiducial CMB power 
spectrum, we use Eq. (fTU|) to calculate the Fisher matrix 
for a given experimental setup. The marginalized 1-a er- 
ror bars of the power spectrum coefficients are then given 
by <Jc £ = v though, strictly speaking, this rela- 

tion only holds for multivariate Gaussian distributions. We 
define the normalized correlation matrix of the power spec- 
trum coefficients according to 



= (J ? " 1 k/,/# = VFS - hit, , (i3) 

where we forced the diagonal elements to zero. As an in- 
dependent test, we probe the shape of the likelihood func- 
tion directly. To this end, we start from the fiducial power 
spectrum and find t he likelihood pe ak by means of Newton- 
Raphson iterations ([Oh et al.lll999f h 



i +1 = Q-lJ2( F ~ r ) 



d\nC 

dC v 



(14) 



We then calculate likelihood slices by varying one power 
spectrum coefficient around the best fit value, while keep- 
ing all others fixed. This approach explores the functional 
form of the likelihood without the need to rely on approxi- 
mations. 



3.1. The benchmark 

In Fig.[2j we show the estimated maximum likelihood power 
spectrum of the reference simulation, Ci, and compare it to 
the input we adopted to synthesize the CMB anisotropics. 
Approximating the likelihood as Gaussian around the peak, 
we can calculate the goodness of fit using the Fisher matrix 
F according to 



X 



(C £ -C} npu yF(C £ -C £ Lnpui ). 



Input \ 



(15) 



Since monopole and dipole are excluded from the fit, we set 
^Vdof = tax — 1, and obtain xV^dof = 1-09. We conclude 
that the fiducial model is consistently recovered. The Fisher 
matrix, depicted in the right hand panel of Fig. [2j exhibits a 
non-trivial shape with noticeable deviations from a simple 
diagonal matrix. 

To characterize the correlation structure of the power 
spectrum, we plot the covariance matrix in Fig. [3] In this 
diagram, rows or columns at low multipole moments ap- 
pear to be dominating. However, this effect is the result of 
the approximate t~ 2 scaling of the CMB power spectrum. 
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We therefore use Eq. (fT3|) to normalize the covariance ma- 
trix for an unbiased assessment of the correlation structure 
and find no evidence for an enhanced coupling of small and 
large scale perturbation^]. Instead, the normalized correla- 
tion matrix is dominated by its diagonal elements which we 
illustrate at low, intermediate, and high multipoles in the 
right hand panel of Fig. [3] Obviously, a power spectrum co- 
efficient Ct shows a substantial negative correlation at the 
10 % level with its direct neighbors Cg±\ and C^±2. Longer- 
ranging correlations are suppressed to below 1 %. As the 
signal-to- noise ratio (S/N) is large in the entire multipole 
range considered here, the evolution of the correlation co- 
efficients towards higher i is negligible. 

It has been noted before, that the likelihood func- 
tion is strongly non- Gaussian at the lowest m ultipo le mo- 
ments (e.g., iBond et al.l 119981 : IWandelt et al.ll2004D . As a 
consequence, exact sampling methods are commonly used 
for the data analy sis in this regime ([Hinshaw et al.l [2009: 
iJarosik et al.|[201l| ). However, and in contrast to previous 
claims in literature, we also find a noticeable deviation from 
Gaussianity at the highest multipole values. We illustrate 
the shape of the likelihood function for large, intermediate, 
and small scale perturbations in Fig. SJ For all power spec- 
trum coefficients, a fit with a simple Gaussian distribution 
leaves residuals at the level of several percent. Combining 
the individual errors for each multipole moment, the total 
error on the likelihood function can be substantial if it is 
modeled by a simple multivariate Gaussian. 

3.2. The effect of asymmetric beams 

At some level, all CMB experiments suffer from beam im- 
perfections, depending on detector design and focal plane 
geometry. For the low frequency instrument aboard the 
Planc k satellite, for exampl e, the beam ellipticity can reach 
40 % (Me nnella et"alll201l[ ). We now study whether asym- 
metric beams by themselves pose a problem in a sense that 
they lead to a loss of information or induce additional cor- 
relations between power spectrum coefficients. Again, we 
study the impact of asymmetric beams on the information 
content that could ultimately be obtained after an optimal 
analysis of the data. 

To this end, we repeated the full likelihood analysis on a 
data set generated with the simulation parameters as above, 
except for the fact that we now included an asymmetric 
Gaussian beam with an axes ratio of cr x /<j y = 2 : 1. We 
kept the geometric mean of the FWHM along the major 
and minor axis fixed at 12'. In Fig. [5j we compare a slice 
through the covariance matrix of the simulation against 
our benchmark model. The relative difference of the entries 
close to the dominating diagonal elements is smaller than 
1 %. We conclude that an asymmetric beam per se, if ac- 
counted for exactly in the analysis, neither degrades the 
information content of the experiment, nor complicates the 
correlation structure of the power spectrum coefficients. 

3.3. The effect of the noise properties 

We now characterize the impact of different noise proper- 
ties on the likelihood function. To be more precise, we first 

1 We note that pseudo-CV power spectrum estimators are 
known to introduce spurious corr elations between small and 
large multipole moments (see, e.g., Efstathiou 2004). 




Fig. 5. Asymmetric beams do not destroy information. 
Upper panel: For a multipole moment of £ = 500, we show 
a slice through the covariance matrix for the reference sim- 
ulation with symmetric beam. Middle panel: The relative 
difference between the covariance matrices calculated with 
symmetric beam (F _1Ref ) and strongly asymmetric beam 
(F _1Asym ) is only of the order 1 %. Lower panel: Beam 
profiles used in the comparison. 



vary the overall noise level without modifying its power 
spectrum, and then analyze the effect of correlated noise. 
The data sets used for this study are visualized in Fig. [6j 

3.3.1. Low S/N likelihood 

For the analysis of the effect of noise on CMB power spec- 
trum estimation, we increased the noise power spectrum 
amplitude in Eq. ([T2]) to a = 200 mK. In this setup, we ob- 
tained a S/N of unity at a multipole t « 700. As expected, 
the diagonal elements of the inverse Fisher matrix reflect 
the increase in noise (Fig. [?])• We also find the correlation 
structure of the Fisher matrix change qualitatively at high 
multipoles, it becomes stronger and more complex. In par- 
ticular, the negative correlation between a power spectrum 
coefficient Ct and Ct±$ increases in strength. The full corre- 
lation matrix, plotted on logarithmic scale to highlight the 
off-diagonal entries, is depicted in Fig. [U Becoming more 
and more relevant with the decreasing S/N towards higher 
the inhomogeneously distributed noise induces additional 
long-ranging correlations between power spectrum coeffi- 
cients at the sub-percent level. We show likelihood slices in 
Fig. [9j When compared to the high S/N case in Fig. HJ the 
analysis reveals a widened distribution at high multipoles. 
Besides that, the non-Gaussian features of the likelihood 
function remain basically unchanged. Only a further de- 
crease in the S/N starts to diminish the deviation from a 
Gaussian shape. In that case, however, the power spectrum 
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Fig. 2. Left panel: The result of an exact maximum likelihood power spectrum estimation (filled circles) shows that 
the input power spectrum can be consistently recovered (solid line). The gray area indicates the 2-a confidence region 
as derived from the Fisher matrix. Right panel: The Fisher matrix of the CMB power spectrum coefficients cannot be 
approximated well by a diagonal matrix. 




Fig. 3. The correlation structure between power spectrum coefficients is complicated. Left panel: We plot the inverse of 
the Fisher matrix, i.e. the covariance matrix of the experiment. For illustrative purposes, we inverted the x and y axes. 
Right panel: Patches around the diagonal elements of the normalized correlation matrix at low (t = 200, upper row), 
intermediate (t — 500, middle row), and high multipoles (t — 800, lower row) reveal significant negative correlations 
between neighboring power spectrum coefficients at the 10 % level. 



coefficients have too large an error bar for them to play a 
substantial role in the estimation of cosmological parame- 
ters. 

3.3.2. Correlated noise 

Next, we study the effect of correlated noise. To this end, we 
generate a simulation with parameters as follows. Choosing 
a noise realization with a = 2 in Eq. ([T2]h we selected a 
knee frequency of k^ nee = 10 -3 in terms of the Nyquist 
frequency, and set the DC mode to zero. Red noise is often 
present in experiments to some level, and it also affects 
the high frequency instrument data of the P lanck satellite 
mission (jPlanck HFI Core Team et alJ2011h . Here, we kept 
the overall noise power spectrum amplitude at a = 200 mK, 



the same level as in the previous scenario. Owing to the 
additional long-range correlations, the noise variance per 
pixel in the data map increased considerably (cf. Fig. [6]). 

The analysis based on the Fisher matrix is shown in 
Fig. [I0l where we plot the error bars and the correlation 
matrix of the experiment. The impact of the red noise on 
the likelihood function is negligible when it is accounted 
for exactly in the analysis. The additional low- frequency 
noise contribution, most relevant for large-scale perturba- 
tions, still falls within a highly signal dominated regime. 
As a result, its effect on the power spectrum estimation 
can be removed efficiently. Only for the smallest multipole 
moments, £ ^ 5, we observe an increase in the error bar by 
no more than 3 %. Accordingly, the correlation structure 
between power spectrum coefficients is virtually unaffected. 
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Fig. 4. The likelihood is non-Gaussian. Upper row: Varying the power spectrum coefficients for different multipole values, 
we plot slices of the likelihood function (solid lines) at low (i = 200, left panel), intermediate (i = 500, middle panel), 
and high multipoles (t = 800, right panel). We also show the best fitting Gaussian distributions (dashed lines). Lower 
row: The absolute residual between the likelihood function and its Gaussian fit is of the order of several percent. 
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Fig. 6. Data sets used to study the impact of noise. Left panel: We show the data of the low S/N simulation with 
uncorrelated noise projected onto an unfolded torus. The scan on a single ring proceeds towards larger values of 02, the 
ring number increases along the horizontal axis (4>i). Right panel: Low S/N simulation, additionally containing l// 2 -type 
noise. Strong correlations extending well beyond individual rings are visible. 



3.4. The effect of partial sky coverage 

Excluding large parts of the sky from the analysis has the 
potential to severely complicate power spectrum estima- 
tions. This problem roots in the mathematical properties 
of the spherical harmonics. Restricted to a subdomain of 
the sphere, the functions do no longer fulfill an orthonor- 
mality relation. As a result, correlations between the CMB 
power spectrum coefficients are induced. 

All ground based C MB experiments observe only abac- 
tion of the sky (e.g., iKosowskvl l200l iRuhl et all I2004D . 
But even for satellite-borne experiments, the contamina- 
tion by secondary sources necessitates to mask large sky 
regions in favor of an unbiased CMB power spectrum es- 



timation (e.g., iJarosik et al.l l201l[ ). To further study the 
effect of restricting the observations to a small part of 
the sky, we analyzed a simulation with a reduced cover- 
age of / s ky = 40 %. We chose the simulation parameters 
of our benchmark data set, except for the opening angle 
of the scanning rings, which we reduced to O = 25°. As 
we still sampled the data with the same number of nodes 
(7V p = N r = 2048), we slightly adapted the noise amplitude 
to the finer grid to obtain the same S/N in the power spec- 
trum, a = ^60 %/40 % • 10 mK « 12.2 mK. In Fig. HQ we 
visualize the normalized correlation matrix. A comparison 
to the right hand panel of Fig. [3] shows an increase in the 
amplitude of correlation by about a factor of two. In addi- 
tion, an even/odd asymmetry becomes visible as the power 
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Fig. 7. Left panel: We compare the Fisher matrix based prediction of the 1-a error bars of the power spectrum coefficients 
between our benchmark (dashed line) and the simulation with lower S/N and uncorrelated noise (solid line). Right panel: 
As a plot of the correlation matrix confirms, an increase in noise towards higher multipoles strengthens the amount of 
correlation and complicates the correlation structure (cf. Fig. [3j). 




0.4 1.0 1.6 0.4 1.0 1.6 0.4 1.0 1.6 



r< I ri Best r< l Best In Best 

^£ = 200 / ( ~^ = 200 W = 500 / ^£ = 500 W = 800 / u £ = 800 

Fig. 9. Same as Fig. |4j but for a simulation with low S/N. Simply increasing the error bars, additional noise, most 
noticeable at the highest multipole moment, largely preserves the non-Gaussian features of the likelihood function. 



spectrum coefficients Ci are now negatively correlated with 
d±s at the 12 % level, whereas the amount of negative 
correlation for Ci±i decreases to 5 %. We plot slices of the 
likelihood in Fig. [I2j The loss of information associated 
with a reduction of sky coverage leads to widened distribu- 
tions, most noticeable at large angular scales. Interestingly, 
the non-Gaussian pattern in the likelihood function are re- 
markably robust and remain at about the same level. 

3.5. The effect of binning 

As established in the previous paragraph, a large galac- 
tic cut induces significant correlations between neighboring 
power spectrum coefficients. Furthermore, the likelihood 
function is non- Gaussian to a noticeable extent. As the the- 
oretical power spectrum is a smooth function of only a few 
cosmological parameters, combining the observed power 
spectrum coefficients into bins is a common practice in pa- 
rameter estimation. This approach reduces scatter in the 
data points and has the potential to simplify the correla- 



tion structure. Here, we analyze the effect of binning on the 
likelihood function. 

We chose a bin width of A£ = 10 and repeated 
the analysis of our reference simulation. In Fig. [I3j we 
plot the correlation structure of the likelihood function. 
Neighboring power spectrum bins are still negatively cor- 
related, although at a smaller amount of about 5 %. The 
bin after next, however, is almost completely decorrelated. 
Furthermore, the shape of the likelihood function, shown in 
Fig. [I4j can be much better described by a Gaussian dis- 
tribution, where the residuals of a fit have decreased to a 
1 - 2 % level. Obviously, binning has the desirable side ef- 
fects to simplify the shape and correlation structure of the 
likelihood function. 



4. Discussion and conclusions 

Owing to its reduced size compared to a full data map, 
the power spectrum of the CMB radiation anisotropics is a 
convenient intermediate stage product in the process of es- 



7 



Franz Eisner and Benjamin D. Wandelt: Likelihood, Fisher information, and systematics of CMB experiments 




i 4 



Fig. 10. Same as Fig. but for the simulation containing noise with additional large-scale correlations. The error bars 
remain at the same level. Only at the largest scales, £ < 5, a mild loss of information is apparent. Changes in the 
normalized correlation matrix are negligible. 
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Fig. 12. Same as Fig. |4j but for a simulation with larger mask. Lowering the sky coverage from 60 % to 40 % leads to 
less stringent constraints on the power spectrum coefficients, but has a negligible effect on the amount of deviation of 
the likelihood function from a Gaussian distribution. 
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Fig. 14. Same as Fig. HJ but for a binned power spectrum. Combining adjacent power spectrum coefficients results in a 
more Gaussian probability distribution. 
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Fig. 8. An increased level of noise, distributed inhomoge- 
nously over the sky, introduces additional long-range cor- 
relations between power spectrum coefficients at the sub- 
percent level. 
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Fig. 13. Binning simplifies the correlation structure. We 
show elements of the correlation matrix for a binned power 
spectrum with a width of A£ — 10. Directly neighboring 
bins are still negatively correlated to a noticeable extent 
(cf. Fig. 13 . 




Fig. 11. A small sky coverage induces complicated corre- 
lations. We plot the normalized correlation matrix for a 
simulation covering / s k y = 40 % of the sky. A larger mask 
increases the correlation in strength and range (cf. Fig. [3]). 

timating cosmological parameters. Sufficient to completely 
characterize a Gaussian random field, it contains all of the 
most fundamental cosmological information present in the 
data (jJungman et al.lll996UBond et al.lll997l ). To construct 
the probability distributions of the parameters out of the 
power spectrum requires a precise modeling of their likeli- 
hood function. That is, one has to specify the shape of the 
function, and the correlation between the individual power 
spectrum coefficients. 

Systematic effects, present in all real- world experiments, 
have the potential to reduce the information content of a 
data set. In the context of a CMB power spectrum analysis, 
issues may arise from beam imperfections, incomplete sky 
coverage, and correlated noise. Here, we conducted a sys- 
tematic study of the impact of these effects on the ability to 
constrain the CMB power spectrum. Assuming a simplified 
scanning strategy that allows for an exact analytical treat- 



ment of the problem, we used the Fisher matrix formalism 
for a quantitative comparison of the information content 
of different experimental setups. To complement the anal- 
ysis, we also explored the shape of the likelihood function 
directly by varying the amplitude of individual power spec- 
trum coefficients. We obtained the following results: 

— For a high S/N experiment that covers 60 % of the sky, 
we found neighboring power spectrum coefficients to be 
negatively correlated at the 10 % level. Interestingly, 
the shape of the likelihood function deviated noticeably 
from a Gaussian distribution, typically by 2 - 5 %. This 
was true even for the highest multipole values probed 
in our analysis, i > 1000. 

— Increasing the noise contribution changed the correla- 
tion structure between different power spectrum coef- 
ficients. It became somewhat more pronounced and of 
longer range. The residuals of a fit of the likelihood func- 
tion with a Gaussian distribution remained basically the 
same. 

— Treated exactly, the mere presence of correlated noise 
did not lead to a loss of information. Comparing two 
simulations with white and colored noise power spec- 
trum, we found l// 2 -type noise to be efficiently re- 
moved. In addition, it did not increase the amount of 
correlation among the power spectrum coefficients. 

— As expected, the impact of a reduced sky coverage on 
the analysis was most important. To study this effect, 
we further limited the observed region to 40 % of the 
sky. As a result, the negative correlation between power 
spectrum coefficients increased to up to 18 %. The 
strength of deviation of the likelihood function from 
Gaussianity, on the other hand, was mainly unaffected. 

— If power spectrum coefficients were combined into 
bins, the correlation structure among them simplified. 
Typically, only adjacent bins remained correlated at a 
noticeable level. Furthermore, the shape of the likeli- 
hood function became more Gaussian. 
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— Finally, if asymmetric beams were present but could be 
accounted for exactly in the analysis, the Fisher matrix 
remained virtually unchanged compared to an equiva- 
lent experiment with symmetric beams. That is, asym- 
metric beams by themselves do not destroy information. 

As a caveat lector, we re-iterate that we calculate an up- 
per bound on the information content provided in these dif- 
ferent experimental scenarios assuming that all instrument 
properties are known well enough that any small remain- 
ing uncertainty does not affect these results. This reveals 
that the quality of a CMB power spectrum measurement 
does not depend in principle on symmetric beams or the ab- 
sence of correlated noise. Recovering the information may 
require sophisticated analysis and sufficient knowledge of 
the instrument parameters. Nevertheless, this finding might 
inspire novel, cost-effective experimental designs that em- 
brace beam asymmetries and/or correlated noise as long as 
these effects are well-calibrated. It would be interesting in 
follow-up work to quantify the maximally allowable cali- 
bration uncertainties for these statements to hold. 

On a more practical level our results are useful in anal- 
ysis of real data sets in several ways: first we point out the 
level of non-Gaussianity in the power spectrum likelihood 
that remains even for relatively large sky coverage and at 
high i. We also illustrate the typical correlation structure 
for measurements with an anisotropic noise distribution. 
Third, these benchmarks help determine the point of di- 
minishing returns, i.e. when it no longer pays off to refine 
the data analysis of a real data set. That point is reached 
when the information recovered from the data set reaches 
a substantial fraction of the theoretical information bound 
we give here. 
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